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New goodness-of-fit tests for Markovian models in time series analysis are developed which are 
based on the difference between a fully nonparametric estimate of the one-step transition dis- 
tribution function of the observed process and that of the model class postulated under the null 
hypothesis. The model specification under the null allows for Markovian models, the transition 
mechanisms of which depend on an unknown vector of parameters and an unspecified distribu- 
tion of i.i.d. innovations. Asymptotic properties of the test statistic are derived and the critical 
values of the test are found using appropriate bootstrap schemes. General properties of the boot- 
strap for Markovian processes are derived. A new central limit theorem for triangular arrays of 
weakly dependent random variables is obtained. For the proof of stochastic equicontinuity of 
multidimensional empirical processes, we use a simple approach based on an anisotropic tiling 
of the space. The finite-sample behavior of the proposed test is illustrated by some numerical 
examples and a real-data application is given. 
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1. Introduction 

The analysis of time series is often based on parametric or semi-parametric model as- 
sumptions that must be tested in practice. An important class of stochastic processes 
used in modelling time series is that of Markov type, which are described by a specifica- 
tion of the transition kernel, usually involving a finite-dimensional parameter vector and 
possibly a partial specification of the distribution of some innovations. 

Many available tests of parametric or semiparametric models in time series analysis 
originate from corresponding frameworks with independent and identically distributed 
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data and are based on the difference between the stationary distribution or the regres- 
sion/autoregression function and corresponding empirical counterparts. Tests proposed 
by Bierens (1982) and McKeague and Zhang (1994) focus on the conditional mean func- 
tion. For the autoregression function, an approach leading to distribution-free tests us- 
ing a martingale transformation has been proposed by Koul and Stute (1999). For an 
overview, see also Delgado and Gonzalez Manteiga (2001). Sometimes, the autoregres- 
sion function that corresponds to a hypothetical model is not explicitly known, which 
causes problems for testing schemes based on it. More importantly, however, a model 
check based on the stationary distribution or the autoregression function is not able 
to detect all types of departures from a hypothetical model. It might happen that the 
autoregression functions and/or the conditional variance functions of two processes are 
similar or identical, whereas their conditional distributions are essentially different. 

In the present paper, we derive tests of the validity of a Markov model by directly 
comparing the hypothetical, model-based conditional distribution with its model-free 
estimated empirical counterpart. Let X = [X t )t^z be the process considered and denote 
by Px. the law of X. Denote by M the class of Markov processes of order less than or 
equal to p, that is, 



where X t _i = (X t -i, . . . , X t - P )' . The problem we consider in this paper is that of testing 
the hypothesis 



In the above notation, G:K ? x6xR^Ris some known function depending on an 
unknown parameter vector 9 and (st)tez is a sequence of i.i.d. innovations with distri- 
bution F e belonging to some appropriate class of distribution functions denoted by T e . 
Note that, under the null hypothesis, the transition kernel generating X t is known up to 
the finite-dimensional parameter vector 9 and the distribution function F £ of the inno- 
vations. Testing problems fitting into this framework arc discussed in Section 2. In this 
paper, an appropriate test statistic for the above testing problem is obtained which is 
based on the supremum deviation of a fully nonparametric estimator of the conditional 
distribution function Fx t \x t _ 1 from its model-based one. If Hq were a simple hypothesis, 
that is, if 9 and F e were specified under Hq, then we could consider as a starting point 
for a test the deviation process given by 



M = {P x : P(X t G B\a(X s ,s < t j) = P(X t G B | X t _i) VB G B, Vt G Z}, 



Hq:Px.£ Mo n M 




Mo = {Px:X t = G(X t _i,0,e t ),e t ~ F e i.i.d.; 9 G 9, F e e T e }. 



1 



71 



y/n 



^I(X t -i±x)[I(X t <y)-F Ho (y\X t _ 1 )}, 



i = i 
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(1.1) 

(x,y) G W x K, 

where i* 1 ^ (y | X t _i) denotes the one-step transition distribution function of the Markov 
model class postulated under Hq and x <y means Xi < yi for all i = 1, . . . ,p. Note that 
Un\x,y) measures, for every point {x,y), the difference between the sample one-step 
transition distribution function and its model-based version under the null. As we will 
sec in the sequel, appropriate specifications of the function Fn {y | X t _i) under Hq lead 
to useful test statistics. We consider as important cases AR(p) or ARCH(p) processes 
and show that the above deviation process can be asymptotically approximated by a 
Gaussian process. Note that, although Mo is general enough, it may be of interest in some 
applications to test more restricted versions of A^o- For instance, in ARCH modelling, it 
is not uncommon to impose specific parametric assumptions on the distribution F £ of the 
innovations (cf. Engle (1982) and Bollcrslcv (1987)). Our inference procedure also allows 
for the testing of hypotheses of this type; see the discussion at the end of Section 3. 

Since the parameters of the limiting Gaussian process of the statistic considered de- 
pend on the actual stochastic process in a complicated way, we approximate the null 
distribution of the test statistic by means of a model-based bootstrap approach. For the 
cases considered, the most natural bootstrap scheme is a model-based bootstrap without 
an additional smoothing of the residuals, that is, a bootstrap based on i.i.d. residuals 
with a discrete distribution. It is known that such processes may fail to satisfy classical 
mixing conditions; see, for example, Rosenblatt (1980). However, using the alternative 
concept of weak dependence introduced by Doukhan and Louhichi (1999), we are able 
to establish some general properties of such bootstrap schemes which may be of interest 
in their own right. In a different context and for inference problems different from those 
considered here, bootstrap methods related to that proposed in this paper have been 
considered by, among others, Basawa, Green, McCormick and Taylor (1990), Rajarshi 
(1990), Paparoditis and Politis (2002) and Biihlmann (2002). 

Testing problems based on the conditional distribution function have attracted some 
interest in recent years. For i.i.d. observations, specification tests of a parametric hypoth- 
esis concerning the conditional distribution have been considered by Andrews (1997) and 
Stinchcombe and White (1998). For dependent, mostly-mixing observations, tests based 
on the conditional distribution function have also been investigated by some authors. 
Li and Tkacz (2001) proposed a test based on an i 2 -type distance between the nonpara- 
metrically estimated conditional density and its model-based parametric counterpart. 
Corradi and Swanson (2001) and Bai (2003) considered a Kolmogorov-type test. Test- 
ing problems associated with linear restrictions on the conditional distribution function 
have been considered by Inoue (1999). Although some of the above approaches deal with 
testing problems similar to our own, none of these papers covers the more general case 
described by our null hypothesis Hq. Furthermore, our analysis is based on the alterna- 
tive concept of weak dependence, which appears to be quite important in the current 
context. We investigate basic properties of model-based bootstrap approaches and, as a 
technical prerequisite for our analysis, we prove a central limit theorem for triangular 
arrays of weakly dependent random variables which does not require moment conditions 
beyond Lindeberg's. 
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The paper is organized as follows. In Section 2, we precisely state our assumptions on 
the underlying stochastic process and discuss some interesting examples of Markovian 
models which fit into our testing framework. In Section 3, the test statistic used is 
presented and its asymptotic behavior under validity of the null hypothesis is established. 
Bootstrap approximations to the distribution of the test statistic under the null are 
investigated in Section 4. Some numerical examples illustrating our theoretical analysis, 
as well as an application to financial data, are given in Section 5. In Section 6, we prove 
a central limit theorem for triangular schemes of weakly dependent random variables. 
Section 7 contains the proofs of all statistical results and a lemma which provides the 
major step in a proof of stochastic equicontinuity. 

2. Assumptions and examples of processes 

We assume that observations A"i_ p , . . . ,X n from a real-valued, stationary process X = 
(-Xi)tgz are available. The null hypothesis is that X is a Markov process of order p with 
a particular form of the conditional distribution, that is, Px belongs to Mo C\M. Recall 
that G:I P x 6 x 1^1 is some known function depending on an unknown parameter 
vector and (st)tez is a sequence of i.i.d. innovations with distribution F £ . Since our 
test statistic below requires estimates of and F s , we must be more specific about the 
models to be considered and it turns out that asymptotic properties must be derived in 
a case-by-case manner. We will focus our attention on the following classes of processes. 

2.1. AR(p) processes 

Here, we suppose that, under Hq, the following condition is satisfied. 

(Al) The process X = (Xt)tez obeys the model equation 

X t = 0'X t _i+e t , (2.1) 

where (et)tez is a sequence of i.i.d. innovations with Ee t = 0, < Eef = a 2 , 
Vej < oo and = (0i,0 2 , . . . , P )' e 6 and 

6 = {9 G W : 1 - 0i z 9 p z p ^ for all z e C with \z\ < 1}. 

In this case, the function G is given as G(X t _i,0,e t ) = 0'X t _i + e t , while P(X t < y \ 
X*_i) - F £ MX t -i, y, 9)) with w(X t _i, y, 9) = y- 0'X*_i. 

It is well known that (2.1) has, under (Al), a unique stationary solution which has a 
representation as a causal linear (MA(oo)-) process, that is, X t = Y^hLo a k £ t-k, where 
| Q!fc | < Kp k for some K < oo, p < 1; see, for example, Brockwell and Davis (1991), 
page 85. Adapting the proof of Lemma 9 in Doukhan and Louhichi (1999), it can be 
shown that (Al) implies, for si < • ■ ■ < s u < t\ < • • ■ < t v and arbitrary measurable func- 
tions g:R u — >R, h:W — >R with F,g 2 (X sl , . . . , X Su ) < oo, F,h 2 (X tl , . . . ,X tv ) < oo, 

\cov(g(X 3l ,...,X Su ),h(X tl ,...,X tv ))\ 
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(2.2) 

OO 

< y/Eg(X Sl X Su ) 2 Up(h)y/E4 \ak\{{k-(h-s u ) + l)Av]. 

k—ti — s u 

Furthermore, it can also be shown that, for s% < • • • < s u < t± < t^, 1 < k±, k% < p, the 
following inequalities hold true: for any measurable g : R" — ► K with Eg 2 (X Sl , X Su ) < 

X, 

| cov{g{X ai , . ..,X 8u ),X tl _ kl e tl )\ < Cy/Eg*(X Bl ,. . . , X s > tl _ fcl _ 8 „ (2.3) 
and, for any measurable g:M. u — >M with j | (? 1 1 oo = sup,j, gRu l.9( :c )l < °°: 

\cov(g(X Sl ,. ..,X Su ),X tl - kl X t2 -k 2 et^t 2 )\ < C||p|| (x ,p mill {t 1 _ fel _ Sttit2 _ fe2 _ Su }, (2.4) 

where p r = •>J'Y^k=r \ a k\i that is, a weak dependence condition similar to those in 
Doukhan and Louhichi (1999) is fulfilled. 

2.2. ARCH(p) processes 

The class of autoregressive, conditionally heteroscedastic (ARCH) processes was intro- 
duced by Engle (1982). In this case, our null hypothesis means that the process fulfills 
the following condition. 

(Al') The process X = (X t ) te z is stationary and obeys the model equation 

X t = y^o + 1 X?_ 1 + --- + 6 p X?_ p £ tl (2.5) 

where G = (6q,Gi,...,6 p )' £ O and 9 = {6 e R p+1 : G > 0, Gi > 0, i = 1, . . . ,p and 
Gi < 1}. Furthermore, (et)tez is a sequence of i.i.d. innovations with Ee t = 0, 
Ee 2 = 1 and Eef < oo. 

In this case, the function G is given as G(Xt_i, 0, £t) = ^J@o + 0\X 2 _ X ^ i~ 8pX 2 - p £ t- 

Moreover, we have that P(X t < y \ X t -i) = F e (w(X t -i,y,6)) with w(X t ,y,6) = 

y/\J®o + 9~iX%~y + h 6 p X 2 _ p . Milh0j (1985) obtained a representation of the unique 

stationary solution to (2.5) as X t ~ et ■ (Oo ■ X)/£Lo-^(*>^)) ' wnere M(t,0) = 1, 
M(t,k) =JZ u -,a k =iHi=i <H £ t-a 1 <h- (Obviously, in the second display af- 

ter equation (2.1) in Milh0j (1985) should read rii=r) The process (X t ) teZ is both 
weakly and strictly stationary. To deal with this process, we will exploit the following 
weak dependence property. 

Lemma 2.1. Suppose that (Al') is fulfilled. There then exist some p < 1, C < oo, such 
that, for all si < ■ ■ ■ < s u < ti < ■ ■ ■ < t v and arbitrary measurable functions g : M. u ► K, 
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with Eg 2 (X Sl , . . . ,X Su ) < oo and Eh 2 (X tl , . . . ,X tv ) < oo, 
\cov(g(X sl ,...,X Su ),h{X tl ,...,X tv ))\ 



< CVE 3 2 (X sl ,...,X Su )Lip(/iy 



(2.6) 



Furthermore, for Si < • • • < s u < t\ < t 2 , 1 < k\, k 2 , &3, ki < p, the following inequalities 
hold true: for any measurable g : K" — > K with Eg 2 (X Sl , . . . , X Su ) < oo, 

\cov(g(X sl ,. . .,X s J,Xl_ ki Xl_ k2 (s 2 tl - 1))| 

, (2-7) 

< C^Eg 2 {X ai ,...,X Su )p^- s - 

and, for any measurable g'.W 1 — >I with \\g\\oo = sup^ggu | (/(a;) | < oo, 

| cov(g(X Sl ,. ..,X s J,Xl_ kl Xl_ k2 Xl_ k3 Xl_ ki (e 2 tl - 1)(4 - 1))| 

(2.8) 

^CILgllooP* 1 "^. 



2.3. Markov processes driven by diffusions 

At this point, we would like to explicitly mention two particular classes of processes 
which are of interest in financial mathematics. 



2.3.1. Vasicek model 

Merton (1971) proposed to model interest rate processes by an Ornstein-Uhlenbeck pro- 
cess (in continuous time), 

dX t = 1 (0 2 -X t )dt + 6 3 dW t , (2.9) 

where 0i,O 2 ,8 3 > and {W t )t& is a standard Wiener process. This model was fur- 
ther considered by Vasicek (1977). Assume that we observe this process at equidistant 
design points At, where t = 0, 1, . . . , n. These observations then form a Markov pro- 
cess with stationary and conditionally normal distributed increments where, for s < t, 
E{X t | X s ) = 9 2 + (X s - e 2 )c- e ^-^ and var(A t | X s ) = 0f (1 - exp{-20!(f - s)})/{29 1 ). 
Hence, introducing appropriate innovations St, we can rewrite {X At )t& in the form 
X A t = G(X A{t _ 1} ,0,e t ), where G(X £k{t -.i ) ,6,E t ) = [0 2 + (^A(t-i) - d 2 )e-^ A ]+e t and 
(£t)tez is a sequence of independent normally distributed variables with zero mean 
and variance 0|(1 — exp{— 26\{t — s)})/(2#x)- Accordingly, the process (Y t ) te z with 
Y t = X\ t — #2 is an AR(1) process with parameter e 9lA which satisfies conditions anal- 
ogous to (2.2) to (2.4) above. 
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2.3.2. Cox-Ingersoll-Ross model 

Also for the purpose of modelling interest rates. Cox, Ingcrsoll and Ross (1985) proposed 
the specification 

dXt = 0i(62-X t )dt + OsVXtdWt, (2.10) 

where 01,62,63 > and (Wt)t>o is a standard Wiener process. Again, the values of the 
process at equidistant time points form a stationary Markovian process. An explicit 
description of the conditional distribution of X\ t given X^u_i\ can be found in Cox et 
al. (1985), page 391. In the special case where q = 40i02/0§ is an integer, it follows that the 
conditional distribution of Y t = cX A t [c = 40i/(0 3 (l - e~ 9lA ))] given Yt—i is noncentral 
chi-square with q degrees of freedom and parameter of noncentrality Yt-ie~ 6lA . 

Let G(X,x) = •F^a( exp {_e 1 A}A)( a; )- With an appropriate sequence (et)tez of 
independent uniform(0, l)-distributcd innovations, we can write (Yt)tez as Yt = 
G(Y t -i,6,e t ), 0= (01,02,0s)'- From E|G(O,0,e t )| < 00 and E\G(u,6,e t ) - G(v,6,e t )\ = 
\EG(u, 0, £{) — EG(v,6, £t)| < exp{— 0iA}|m — v\, it follows, analogously to Lemma 10 of 
Doukhan and Louhichi (1999), that conditions such as (2.2) to (2.4) are fulfilled. 



3. The test statistics and its limit distribution 

Consider testing the hypothesis H of interest. Since P(X t < y | X t _i = x) cannot be con- 
sistently estimated in general, we construct our test statistic from cumulative versions 
of the hypothetical transition probabilities and model-free estimators thereof. As men- 
tioned in the Introduction, the basic idea needed to construct an appropriate deviation 
process is to consider the difference between a fully nonparametric version of the one-step 
transition distribution function and its parametric version postulated under Hq, that is, 
to consider the basic deviation process Un\x,y) given in (1.1). To specify Fn (y | X ( _i) 
given in this equation, that is, to specify the one-step transition distribution function un- 
der the null hypothesis, we proceed as follows. Since the null hypothesis is only partially 
specified, that is, and F e are unknown, we replace these unknown quantities by their 
corresponding sample estimates. In particular, and in order to deal with the uncertainty 
introduced by the fact that is unknown, wc assume the following 

(A2) The sequence of estimators admits the expansion 

6-6=-y2l(X t _ 1 ,X t ;6)+o P (n~ 1 / 2 ), 
t=i 

where / (•,•;■) is a measurable function from RPxlxBto R k with E e l(X t -i,X t ;6) = 
(0,...,0)' and E e ||Z(X t _i,X t ;0)|| 2 <oo. 

Note that in the AR(p) case, assumption (A2) is satisfied with k =p, l(X t -i,X t ;6) = 
r~ 1 Xt_iet and r p = r p (0) = Ee(Xt_iX(_ 1 ) if is the commonly used least-squares or 
Yule-Walker estimator (cf. Brockwell and Davis (1991)). For the linear ARCH(p) case, 
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(A2) is, for instance, satisfied if 9 is the least-squares estimator of 9 = (9 , 6i, ■ ■ ■ , 9 p )' . 
In this case, k = p + 1, Z(X t _i, X t ; 9) = C^Yt-iY^Ce? - 1), where C p = C p {9) = 
Ee(Y t _iYj_ 1 ) and Y t _! = (l,*^, . . . ,X?_ p )'. 

To deal with the fact that F e is unknown, we replace F e by the empirical distribution 
function of estimated residuals e t , that is, 

i " 

i f, e(y) = -^/(£t<y), (3.1) 

t=l 

where £< = w(X tl X tl 9). Note that the estimator used is assumed to satisfy (A2). Note, 
further, that instead of F s , we can also use an estimator of F e based on centered and/or 
standardized residuals like those used in the bootstrap schemes discussed in Section 4. 
This, however, will result in an extra term in the asymptotic covariances of the finite- 
dimensional distributions of the process U n given below. 

The above considerations and the resulting specification of J^ (-|Xt_i) in (1.1) lead 
to the basic deviation process 

1 " ~ 
U n (x, y) = -= V J(X t _i < x)[I(X t <y) — F e («;(X t _i,y, (?))], (3.2) 

which is used in the sequel for testing the null hypothesis of interest. A basis for a test 
of Hq is now given by the suprcmum deviation, 

S n = sup \U n (x,y)\. (3.3) 

Notice that, for any x\, . . . , x k -i, x k +i, ■ ■ • , x p , y (1 < k <p), U n (xi, . . . ,x k -i, ■ ■ ■ ,x k +i, 
...,x p ,y) is piecewise constant with possible jumps at X\_ k , . . . ,X n - k . Furthermore, 
U n {x,-) has possible jumps at X\, . . . ,X n and is monotonously nonincreasing between 
these jumps. Hence, it follows that 

S n = max \U n (x,y)\, 

(,x,y)£Xo 

where X n = {-oo,X , . . . ,X n -i} x ••• x {-co, Xi- P , . . . , X„_ p } x {-oo,X 1 - 0, 
Xi, . . . ,X n — 0,A„,co}; that is, it suffices to compute the test statistic by evaluating 
U n on the grid X n . 

To derive the asymptotic distribution of S n , we first determine the limit distribution 
of the processes U n . Let D = D(M. P+1 ) be the space of cadlag functions on the extended 
(p + l)-dimcnsional Euclidean space R p+1 , that is, of functions which arc continuous from 
above and possess limits from below. It is clear that U n belongs to D with probability 1. 
Since we deal with suprema of these processes, it is convenient to endow D with the 
suprcmum norm || ■ || and to prove weak convergence of the distributions in the normcd 
space (D, || • ||). In accordance with the discussion in Section IV. 1 in Pollard (1984), we 
do not endow the space D with the Borel a-field generated by the closed sets under the 
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uniform metric since this cr-field would be too rich, consequently creating measurability 
problems. Rather, we use the projection cr-field V generated by the coordinate projection 
maps. Since S n can be written as sup^ . y ) S Qp X Q \U n (x,y)\, it is clear that it is (V — im- 
measurable, where B is the Borcl cr-field. The fact that D is not separable does not 
matter in the following since the limit process U is concentrated on a separable subset 
of continuous functions. Consequently, we can apply the continuous mapping theorem 
(Theorem V.l in Pollard (1984)) to derive the limit distribution of the test statistics. 
The following theorem establishes the asymptotic behavior of S n if the null hypothesis 
is true. 

Theorem 3.1. Suppose that Hq is true with X t = G(X t _i, 9, £t) in Mq satisfying (Al) 
or (Al'). Assume, further, that (A2) is fulfilled. Ifn^oo, then 

where U is a Gaussian process with continuous sample paths, zero mean and covariance 
function 

^((x 1 ,y 1 ),(x 2 ,y 2 )) 

= ^2cov(g 1 (X ,X ;x 1 ,y 1 ) + g 2 (Xo,X ;x 1 ,y 1 ) + g 3 (X Q ,X ;x ll y 1 ), 

g 1 (X t -i,X t ;x 2 ,y 2 ) + £f2(X t _i, X t ; x 2 , 2/2) + 9s(X-t-i, X t ; x 2 , y 2 )) 

with 

gi (X t _ u X t ; x, y) = J(X t _i ■< x)[I(X t <y)- F e (w(X t - U y, &))}, 
g 2 (X t _ 1> X t ;x ) y) = l(X t . 1 ,X t ;ey f F £ (w(z,y,9))P x ^ (dz) 

J{z^x} 

and 

g 3 (X t _ 1 ,X t ;x,y) = - [ [I(e t < w(z,y,6)) - P(e t < w(z,y 7 9))]P x ^(dz). 

Theorem 3.1 and an application of the continuous mapping theorem together yield 
that an asymptotically a-level test for testing Hq is given by the following rule: reject 
Hq if S n > £i_ Qi00 , where £i_q i00 denotes the (1 — a)-quantile point of the distribution 
of sup( x y - jeRPxR \U(x,y)\. A bootstrap approach to calculating these quantiles is given 
in the next section. 

We conclude this section by mentioning that testing more specific null hypotheses 
concerning the transition kernel of the underlying Markov process is also possible using 
our approach. For this, appropriate specifications of the basic deviation process (1.1) 
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can be used, depending on the specifications imposed of Fh (- | X t _i). Clearly, the most 
simple case is that of a fully specified Markov model, that is, that of testing 

: Fx e {Fx : X t |X t _i ~ F (- | X t _i)} n M. 

The obvious specification of Un (x,y) which is appropriate in this case is 

1 " 

uW{x,y) = -= V J(Xt_i d ^)[/(X t <y)- F (y | X 4 _ x )]. (3.4) 

Another example, which is of more interest in applications, is where the one-step tran- 
sition distribution function under the null depends on an unknown vector of parameters 
9, but the distribution function of the innovations is specified, that is, where 

H { a 2) :Pxe{Px:Xt = G(Xt-i,0,et),e t ~F e i.i.d.; 9 e 6} n M. 

In this case and instead of (3.2), the specification 

1 " 

should be used. It can be shown, along the same lines as in the proof of Theorem 3.1, 
that Un — — > f/'- 2 -', where [/( 2 ) is a zero-mean Gaussian process, the covariance function 
of which is that obtained from r((xi, yi), (x2, 2/2)) given in Theorem 3.1 after ignoring 
the component (73. 



4. Bootstrap approximations 
4.1. The bootstrap procedure 

To approximate the distribution of S n under the null hypotheses, we use a model-based 
bootstrap approach which employs the particular structure of the generating equation 
X t = G(Xt_i, #,£(). In this context, the unknown parameter 9 is replaced by its esti- 
mator 9. Furthermore, the innovations e t are replaced by pseudo-innovations generated 
according to the empirical distribution function of estimated errors. In particular, the 
pseudo-innovations are generated using the empirical distribution function 

n 

F e (y) = -Y^m<y), (4-1) 
n 4 

t=i 

where, for instance, in the AR(p)-case, is given by St = e t — Ti~ X^=i and, in the 

ARCH(p)-case, by e t = £t/\J n^ 1 J2"=i £ s with e t = e t — rT 1 X^"=i where e s is defined 
in the sentence following equation (3.1). 
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The bootstrap algorithm used to approximate the distribution of S n under the null 
hypothesis is described by the following three steps. 

1. Let Xq = (Xq , J* j , . . . , be some starting values. Given X*_ x = {X^_ 1 ,X*_ 2 , 
■■■,X;_ p ), generate X; by 

where e* t are i.i.d. random variables with e* t ~ F £ , where F £ is defined in (4.1). 

2. Based on the bootstrap pseudo-series (X*)t=i- P ,2- P ,...,n, let U*{x,y) be defined as 
U n {x,y) and obtained by replacing 9 and F e in U n {x,y) by 9* and F* , respectively. 
Here, 9* denotes the same estimator as 9 based on (X t *)t = i_ Pj 2-p n- Furthermore, 

1 " 

F:{y) = -Y J I in<v), (4-2) 

t=l 

where e£ = w{%^ , X t * , 0*) . 

The bootstrap analogue of S n is now given by 

5*= sup |C7*(x,y)|. 

3. Reject -ffo if 

where t*_ Q oo denotes the (1 — a)-quantile of the distribution of S* . 

4.2. Some basic properties of the bootstrap processes 

The derivation of theoretical results for the bootstrap is based on a case-by-case inves- 
tigation since to establish properties such as stationarity and weak dependence of the 
bootstrap process, the particular model structure generating the X£ 's is explicitly used. 
Note that the bootstrap counterpart to (X t )t£Z is the stationary (if it exists) process 
(X t *) tg z obeying the equations 

X^GiXUAe;). (4.3) 

We will see below that, in the AR(p) and ARCH(p) cases considered here, a unique 
solution to (4.3) exists with a probability tending to 1. 

We first deal with the properties of the proposed bootstrap procedure to approximate 
the hypothesized conditional distributions p-^'l^- 1 under the null hypothesis. The fol- 
lowing lemma shows that the conditional distributions of the bootstrap process converge 
to the conditional distributions of the original process under the null and to some le- 
gitimate conditional distribution under the alternative. This convergence takes place in 
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probability. To formulate such results in a transparent way, we define the following metric 
between distributions on (M. d ,B d ): 

d(P,Q)= inf E[||X-y|| Al], 

X~P,Y~Q 

where the infimum is taken over all pairs (X,Y) with X ~ P and Y ~ Q, and where 
|| • || is any norm on R d . A metric similar to this (the Mallows metric) has been used 
by Bickel and Freedman (1981), also in the context of proving bootstrap consistency. 
Convergence in the above metric is, in particular, equivalent to weak convergence. Con- 
cerning the behavior of the estimator 9 in the case that the null hypothesis is not true, 
we make the following assumption. 

(Bl) There exists 9 £ such that 9 — ► 9 in probability. 

Lemma 4.1. Suppose that (Al) or (Al') and (Bl) are fulfilled. Assume, further, that 
the density f e = F' e of e t is bounded, where e t —w(lLt-i,Xt,9). Then, for every compact 
set K C W, 

sup (i (p^|xr_ 1 =x ) px t |x t _ 1 =x ) Q 

According to the above lemma, the asymptotic limit of P x± ' x '- 1 depends on whether 
the underlying hypothesis is true or not. In particular, P Xt ' x *-! converges to that of 
the Markov process generated by X t = G(X t _i, 9, et), where the innovation sequence has 
distribution F e . If Hq is true, then 9 = 9 and F e = F £ . Note, however, that Px* £ -Mo, 
where Px* denotes the law of X* = {X£,t G Z}, even if the null hypothesis is not true. 
This is important for a good power behavior of the bootstrap-based test. 

The following lemma is the key step in proving consistency properties of Markov boot- 
strap. It basically states that convergence of the conditional distributions implies con- 
vergence of the stationary distributions. 

Lemma 4.2. Suppose that (Yt)tgz and (Y^ )t^z, n€N, are stationary Markov pro- 
cesses of order p, defined on probability spaces (Q,A, P) and {Q, n ,A n ,Pn), respectively. 
Suppose that 

(i) for all compact sets K C W , 

sup rf(P^ ( " >|F ^ =yi '---' y '-= yp ,P^I^-i^i^-.^-P=^) 0; 



yen 

(ii) for allyeW, 



Sup ^pY t \Yt- 1 =yi.....,Yt-p=y P ^ pY t \Yt- 1 =yi,...,Y t - p =y p ^ > q. 

~n~ 1 1 - je <5^0 

y-\\y-y\\<o 

y-Cn) 

(hi) (P„* )„ eN is tight; 
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(iv) there is a unique stationary distribution P y i>—> y p that corresponds to P Y t\ Y t-ir--, Y t- 
Then, for all keN, 

Pf 1 •-'* =^P^'-.^. (4.4) 

By Lemmas 4.1 and 4.2. wc obtain the following result which shows the convergence 
of the finite-dimensional distributions of the bootstrap process to the desired joint dis- 
tributions under the corresponding null hypothesis. 

Corollary 4.1. Suppose that the assumptions of Lemma 4.1 are satisfied. Then, for all 
keN, 

d (px* t ,...,xt + ^ p x t ,...,x t+k ^_p^^ 

X X 

where P- t+k denotes the stationary probability measure of the Markov process 

(-Xt)tez generated by X t = G(X t _i, 8, e*) and where the i.i.d. innovation sequence sat- 
isfies e t ^F e . 

Our next result deals with the weak dependence properties of the bootstrap processes. 

Lemma 4.3. Suppose that the assumptions of Lemma 4.1 are satisfied. There then exist 
sets f2 n C such that P((Xi— p , . . . , X n ) £ Q„) — > 1 and, for any sequence (io n ) ne ^ 

n — >oo 

with Lu n € Q n , (X t *) t6 z satisfies (conditionally under (Xi_ p , . . . , X n )' = LJ n ) conditions of 
weak dependence analogous to (2.2)-(2.4) and (2.6)-(2.8), respectively, with coefficients 
of weak dependence that can be majorized by a geometrically decaying series. 

4.3. Bootstrap validity- 
Based on the basic properties of the bootstrap procedure stated in the previous section, 
we are now able to justify asymptotically its use in obtaining critical values of the test 
statistics S n . As in (A2), we assume the following. 

(B2) The sequence of estimators 9* admits the expansion 

~ 1 " 
t=l 

where /(•;•) satisfies E*/(X*_ l7 X*- 6) = and E*||Z(X^_ 1 ,X t *;?)|| 2 = O p (1). 

The next theorem establishes the asymptotic limit of U* which is used to evaluate the 
distribution of the test statistic S n under the null hypothesis. 
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Theorem 4.1. Assume that (Al) or (Al') as well as (Bl) and (B2) are fulfilled. Sup- 
pose, further, that f e = F' e is continuous and f e (z) — > 0, where F e denotes the dis- 

z — >±oo 

tribution function of et = w(X t -i, X t ,9). There then exist sets £l n C R p+n such that 
P((Xi-p, . . . ,X n )' € fi„) — ► 1 and, for every sequence (oj n )neN with u) n £ £l n , we have 

n — >oo 

(the bootstrap distribution is taken conditionally under (Xx— p , . . . ,X„)' =u>„) 

U* Jl^jj 

^ n ^ 

as n— > oo ; where U is a Gaussian process with continuous sample paths, zero mean and 
covariance function 

r((xi,j/i), (0:2,2/2)) 

= ^r f cov(jj 1 (Ko,X ;xi,y 1 ) + g 2 (X ,X ;x 1 ,yi) + g 3 (X 0) X ;x 1 ,y 1 ) ) 

tGZ 

g^Xt-i, X t ;x 2 ,y2) + g 2 (X t -i,Xf,x 2 ,y 2 ) + g 3 (X t -i, X t ;x 2 ,y 2 )), 



and gi(Xt-i,Xt;x,y) is defined for i = 1,2,3 as gi(Xt-i,Xt;x,y) in Theorem 3.1, with 
9 replaced by 9 and F £ by F e . 

Note that if Ho is true, then U = U since 9 = 9 and F e = F e . In this case, the limiting 
behavior of the bootstrap statistic S* is identical to that of the statistics S n given in 
Theorem 3.1. On the other hand, if Hq is not true, then t*_ a ^ — > c as n — > 00, where c 
denotes the (1 — a)-quantile point of the limiting distribution of sup^ 9 ) gRPxR \U(x, y)\. 
In this case, n -1 / 2 ^ — ► C in probability, where C denotes a positive constant. Therefore, 



lim P(S n >tl_ ) 



a, if Hq is true, 
1, if Hi is true, 



that is, the test based on the bootstrap critical values t*^ a x asymptotically achieves 
the desired level a and is consistent. 

We conjecture that our test has nontrivial power for local alternatives converging to 
the null at a y^n-ratc. To illustrate this, consider the simple case of a fully specified null 
hypothesis H^ and sequences of local alternatives corresponding to Markov processes 
having one-step transition distribution functions given by 

F n (-\X t . 1 ) = F (-\X t . 1 ) + ^H(;X t _ 1 ), 

v n 

where H() is an appropriate function satisfying 

n 

- V 7(X t _! ■< x)H(y, X t _r) -> D(y, x) # (4.5) 
n 

t=i 
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in probability as n — + oo. In this case, the corresponding deviation process Un (y,x) can 
be decomposed as 

1 " 

n 

+ -y2KX t -ilx)H(y,X t -i), 
n z — ' 
t=i 

from which (taking into account (4.5)) the desired result will follow by showing that 

1 " 

UP(y,x) = J(X,_i X i)[/(X t < y) - F n (y\X t ^)] 

converges to the same Gaussian process as the process (3.4). 



5. Numerical examples 

(2) (2) 

Example 1. The test statistic Sn = mas( Ii!( )g^n \U„ (x, y)\ is applied to test the hy- 
pothesis that the underlying process obeys the ARCH(l) structure 

X t = 8q + Q\Xl_ x et 

with independent and standard Gaussian-distributed errors. Three different sample sizes, 
n = 100, 200 and 400, have been considered. The parameters of the process have been 
set equal to 6 = 0.1 and Q\ = 0.4. The results obtained are based on the least-squares 
estimator of 8q and 9\. Table 1 presents the empirical rejection probabilities. To inves- 
tigate the power of our test procedure, different types of alternatives have been con- 
sidered. One alternative to the hypothesis of an ARCH(l) process with Gaussian er- 
rors is where the distribution of the innovations is given by St = r\t \fiy — 2)/V, where 

Table 1. Empirical rejection probabilities for testing the hypothesis of an ARCH(l) model 



HP TRUE: H n (2) FALSE: 







Q 


ARCH(l) 
e t ~Af(0,l) 


ARCH(l) 

st ~ U 


ARCH(2) 
02 = 0.4 


GARCH(1, 1) 


SV-Model 


n = 


100 


0.05 


0.064 


0.202 


0.190 


0.151 


0.460 






0.10 


0.109 


0.310 


0.278 


0.235 


0.660 


n = 


200 


0.05 


0.041 


0.281 


0.295 


0.225 


0.775 






0.10 


0.115 


0.415 


0.402 


0.345 


0.871 


n = 


400 


0.05 


0.046 


0.485 


0.515 


0.366 


0.980 






0.10 


0.112 


0.671 


0.635 


0.495 


0.991 
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(Vt)tez is an i-i-d- sequence of i-distributed random variables with v degrees of free- 
dom. We also investigated the power of our test for the case where the true process 
is an ARCH(2) process with additional parameter 62 = 0.4, a GARCH(1, 1) process 
X t = (J t e t with a\ = 0.08 + 0.7Xf_ 1 + 0.2af_ 1 and the simple stochastic volatility (SV) 
model X t = exp{/i t /2}e t and h t = —0.9 + 0.6h t -i + cot, where u>t is a sequence of inde- 
pendent standard Gaussian random variables. 

Example 2. Using the test statistic S n , the hypothesis of interest is that the underlying 
process is an i.i.d. process with standard Gaussian- or uniform-distributed innovations. 
The alternative considered to this null hypothesis is that the underlying process is a 
first order autoregressive process X t = 0X t -\ + e t with e t ~ Af{0, 1) and three different 
values of 9. Table 2 presents the empirical rejection probabilities of S n for testing the 
corresponding null hypothesis based on sample sizes of length n = 25, n = 50 and n = 100. 

The results presented in Tables 1 and 2 are based on 200 replications of the under- 
lying process, where, for each replication, critical values of the test have been obtained 
using 500 bootstrap samples. Although computational requirements prevented us from 
considering larger sample sizes, more trials, more bootstrap replications or more com- 
plicated models, the results obtained are very encouraging. In particular, and as these 
tables show, the test statistic retains the desired size under the null hypothesis and shows 
a very good power behavior for the different types of alternatives considered. 

A real-data example. We apply our testing procedure to the first n = 2000 observa- 
tions of the monthly log-returns of the Intel stock series analyzed in Tsay (2005) . Tsay 
(2005), page 109, selected, for this series, the ARCH(l) model 

r t = 0.0174 + X t , X t = ^0.0134 + 0.2492X t 2 _ l£t , 

with standard Gaussian-distributed innovations et- For this model, the value of the test 
statistic Sn =xaaX( Xiy -\ e x n \Un (x,y)\ equals 15.025, while a bootstrap estimate of the 
upper 5% percentage point of the distribution of the same statistic under the null equals 



Table 2. Empirical rejection probabilities for testing the hypothesis of an i.i.d. sequence 









Hjf ] TRUE: 
£t ~AA(0,l) 


H ( 3) FALSE: 
et~U(-^3,v / 3) 


= 0.2 


= 0.4 


= 0.6 


n = 


25 


0.05 


0.075 


0.081 


0.105 


0.330 


0.625 






0.10 


0.152 


0.161 


0.268 


0.485 


0.740 


n = 


50 


0.05 


0.041 


0.058 


0.170 


0.585 


0.889 






0.10 


0.104 


0.105 


0.270 


0.709 


0.925 


n = 


100 


0.05 


0.062 


0.069 


0.305 


0.834 


0.995 






0.10 


0.115 


0.118 


0.455 


0.925 


0.998 
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1.501. This percentage point has been estimated using B = 1000 bootstrap replications. 
Our testing procedure therefore leads to a rejection of the above ARCH(l) model for 
the Intel stock scries. Note that standard methods, based on residuals, for checking the 
fit of the above model do not indicate any inadequacy of the fitted ARCH(l) model in 
describing the conditional heteroscedasticity of the data; see Tsay (2005), page 111, for 
details. 

6. A central limit theorem 

The first central limit theorems for weakly dependent sequences were given by Corollary A 
in Doukhan and Louhichi (1999) and Theorem 1 in Coulon-Prieur and Doukhan (2000). 
While the former result is for sequences of stationary random variables, the latter one 
is tailor-made for triangular arrays of asymptotically sparse random variables as they 
appear with kernel density estimators. Below, we state a central limit theorem for general 
triangular schemes of weakly dependent random variables. An interesting aspect of this 
result is that no moment condition beyond Lindebcrg's is required. 

Theorem 6.1. Suppose that (X n k)k=i n> n£N, is a triangular scheme of (row-wise) 
stationary random variables with "EiX n k = and EA^ k < C < oo. Furthermore, we as- 
sume that 

1 " 

-]Tea2 J(|X„. fe |/^>e) — > (6.1) 
n — * n — >oo 

fc=i 

holds for all e > and that 

v&r(X n>1 + --- + X n>n )/n — ► <r 2 e [0,oo). (6.2) 

n — >oc 

For n > uq, there exists a monotonously nonincreasing and summable seguence (0 r ) r £f$ 
such that, for all indices 1 < Si < s 2 < • ■ ■ < s u < s u + r = t\ < i 2 < n, the following upper 
bounds for covariances hold true: for all measurable and quadratic integrable functions 
f:« u >R, 

| cov(/(X„ >Sl , . . . , X n , s J, X nM )| < yjEf 2 (X n , Sl ,...,X n , Su )6 r (6.3) 

and for all measurable and bounded functions f : K" > K., 

| cov(/(X„, Sl , . . . , X„, S J, X„ )tl X n , t2 )| < H/IU^r, (6.4) 
where ||/||oo = Sup xeE „ Then, 

-L(X nA + --- + X n , n )^Af(0,a 2 ). 
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Proof. If cr 2 = 0, then wc obviously have that ^(X„,H \-X„. n ) -^U jV(0, 0). There- 
fore, it remains to prove the assertion in the case a 2 > 0, which we assume in the rest of 

the proof. Let Y n ^ — X n ,k/ y/E(X n ,i + h X ntn ) 2 . In view of (6.2), it suffices to show 

that 

Y nA + --- + Y n , n ^Af(0,l)- (6.5) 

To prove this, we use the classical Lindeberg method which was first adapted to causal 
CLT's by Rio (1995). 

We set al = var(X„ 4 H h X„ >n ) = nvar(X n ,i) + 2X)"=i(" - j)cov(X n s,X nj+1 ) 

and v n , k =var(y„ 4 H hFn^J-va^H hl^fc-i) = (var(X„a) + 2^=i 1 cov (^n,i 5 

^nj+i))/^- Wc obtain, by | cov(X n>1 , X ntj+1 )\ <C6j, that 

|n« n ,fc - 1| < — [J2 3°i + E( n - 3)0 i ■ 
n Vj=i 3 =k / 

Since, by majorized convergence, (j/ n )^j — * an d 53^=*- ^? — y 0, it follows that 

there exist fco , n £ N such that 

^n,fc — for all (n, fc) with n > no and fco < fc < n. (6-6) 

To simplify the notation in the rest of the proof, we pretend that (6.6) holds for all 
(n, k) with n > 1 and 1 < k < n. (Otherwise, we start with no and sum the first fco — 1 
random variables in each row to a new random variable, Y n _Q = Y n _\ + • • • + Y nt k -i- We 
then prove the assertion for the sums Y n _o + Y n ^ a + • • • + Y n>n .) 

Let h : R — > R be an arbitrary, three times continuously diffcrentiable function 
with \\h^ || oo =: Cj < oo, j = 0, . . . , 3. Furthermore, let Z„_k ~ A/"(0, v, h k), k = 1, . . . , n, 
be independent random variables which are also independent of (3^ Jl fe)fe=x,...,n- Since 

v n< \ + h v n , H = 1, it follows from Theorem 7.1 in Billingsley (1968) that it suffices to 

show that 

Eh(Y nA + ■■■ + Y n<n ) - Eh(Z nil + ■■■ + Z n , n ) — » 0. (6.7) 
We define S n ,k = Z)j=i ^nj and T„, fe = £)? =fc+1 Then, 

E/i(F n ,i + ■ ■ ■ + F n ,„) - Eh(Z nA + ■■■ + Z njl ) = J2 A n.k, 

fc=l 

where 

A„^ = E[/i(SVi,fc + in.fc + Tn.fc) — h(S n> k + Z n ,k + T n ^)\. 
We further decompose A„ t k = A^ 1 ]. — A^, where 

A S - E/i(5 n , fc + F„. fc +T n , fe ) - Eh(S n>k +T ntk ) - ^EhW(S n , k +T ntk ), 
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A ni = Vh(Sn,k + Z n , k + T n . k ) - Eh{S n , k + T„, fc ) - ^Eh& (S n , k + T nM 



We will show that 



E A S 

fe=l 



fori = 1,2. 



(i) Upper bound for |ELi A Sj- 

Since EZ n ^h! {S n ,k + 2n,fe) = 0, we have, for some random p n ^ G (0, 1), that 
Z 2 



Hence, we obtain that 

n 

£ A S 



fe=i 



„ n 



k=l 



< ^E|AA(0,1)| 3 max {^^} — > 0. 



(ii) Upper bound for | £Li A i fcl- 
Let e > be arbitrary. We will actually show that 



k=l 



< e for all n > n(e) 



We have, for some random r nyk G (0, 1), that 



A n fc = E ^^'(5„, t + T„ jfc ) + E 



y'2 



h {2 \S nM +T nM Y nM +T n , k ) 



^LEhW(S n , k + T n 



Since 'EY n ^h' \T n ^) = 0, we have, again for some random fJ, n ,k,j G (0, 1), that 
EF„,fc^'(5'?i,fc + In,*.) = ^2 E^ / n,fe[^'(5'rij-+i + 7?i,fc) — h'(S ni j + T 7ly k)] 

3=1 
fc-1 



(6.8) 



(6.9) 
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This yields, in conjunction with 



k-l 



v n . k = EF„ 2 fe + 2 J2 EY ntk Y niit 



that 



fc-d 



A i fc = E EY ™,kY n ,j [h {2) {S n ,j + l>, ,.!...)],.. + T„,fc) - Eh^ (SnM + T n ,fc)] 



fc-1 



+ ^ EK^Y^ ^ 2 ' + lin.kjYnj + T n . k ) - Eh™ (S n , k + T n , k )\ 
j=k—d+l 

+ \-EYl k [h^ 2 \S nM + r n , k Y n . k + T n>k ) - EhW(S n , k + T n>k )] 

_ A (l,l) , A (l,2) +A (1,3) 
— ^n,k ^ ^n,k ^ ^n,k > 

say. (The value of d does not depend on n and its proper choice is indicated below.) 
We now have, by (6.3), that 

i < E (^) ^ = o E °?J ■ 

By choosing d sufficiently large, we obtain that 



fc=i 



< - for all n > n(e). 
- 3 - w 



(6.10) 



The term A^'j 2 will be split up as 



k-l 



n,k,jY n j ~\~T n ^k} h (Sfi^—d ~l~^n,i 



k-l 



+ J2 Ey„, fc r„ J [^ 2 H^„ J -d + ^ fe )-E^ 2 )(5„ J _ d + T„ ifc )] 
j =k -d+i 



k-l 



+ 5^ Ey„, fc y; lJ [E/ l ( 2 )(5„ J _ d + r ri , fe )-E/ l ( 2 )(^ l , fc + T ri , fc )] 
j =k -d+i 

A (1.2,1) , A (1,2,2) A (1,2,3) 
^n,fc ^n,fc ^\i,fc ' 



34 M.H. Neumann and E. Paparoditis 

say. The Lindeberg condition (6.1) yields that, for arbitrary e' > 0, 



E A 

fe=i 



(1,2,1) 
n.k 



<2C 2 



\ fe=i 



n A:— 1 



\E E E ^'< 

\ fc=l j=fc-d+l 



n fc-1 



X E E E ^ 



fc-1 



. E E E[/i(a)(S n j+/i n , w y n j+r n ,fc)-hC2)(5 TlJ -_ ( j+T nifc )] 2 

\ fe=lj=fe-d+l 

= 0(1) + 0(0- 
Using condition (6.4), we obtain that 

\A^ 2) \ = 0(n- 1 d6 d ). 

From the monotonicity and summability of the sequence (9k)k£N, it follows that 
d9d — ► 0. Furthermore, the relation 

d — »oo 

<A»)=0(n-»/ a ) 
is obvious. Again, for sufficiently large d, these upper estimates yield that 



E A 

*;=i 



(1,2) 
k 



< - for all n > n(e). 
- 3 - w 



Finally, we obtain, in complete analogy to the calculations above, that 

n 

E A ^ } — o. 



fe=l 



which completes, in conjunction with (6.10) and (6.11), the proof of (6.9). 



(6.11) 

(6.12) 

□ 



7. Proofs of auxiliary lemmas and main results 

Proofs of some of our main results are given in this section, while, for some others, 
we stress only the essentials. More details, as well as the proofs of Lemma 2.1 and 
Theorem 4.1, which are omitted in the sequel, are given in Neumann and Paparoditis 
(2005). 

There has been much effort made in the literature to prove stochastic cquicontinuity, 
often as a sufficient condition for tightness, of families of multivariate processes. For a 
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family of processes (X n ) n ^ with sample paths in C([0, l] 9 ), one seeks to show that, for 
all 6 > 0, r\ > 0, there exists an e > such that 

P{u e {X n ) > 6) < v Vn>n , (7.1) 

where the modulus of continuity is defined as lo £ (x) = supu s _ t n <E \x(s) — x{t)\. For a 
family of processes on R 9 , one can either transform them to processes on [0, l] q or one 

can alternatively show that, for all S > 0, r\ > 0, there exists a grid Q = {&q , . . . , } x 

• • • x {6 , . . . , b$} with — oo = 6 r ' < • • • < b^J = oo, r = 1, . . . ,q, such that 

P( max sup \X n (t)-X n (b(i))\>S)<ri, (7.2) 

yi<»i,...,*,<Af 6(i-i 9 )^t^6(i) / 

for all n> uq. Here, and in the following, we use the notation i= (ii, . . . ,i q ), t = 
(h,...,t q ) and &(i) = (4 1) ,...,6g ) ). 
It has been shown, for example, in Pollard (1984), Theorem 3 in Section V.l, that 

(7.2) and the weak convergence of finite-dimensional distributions of X n to those of 
a process X with continuous sample paths together imply that (X n )„ S N converges in 
distribution (with respect to the suprcmum norm) to X. (Pollard actually proves this for 
processes on [0,1]; the extension to processes on R q is, however, obvious.) An obstacle 
to proving (7.2) arises since the supremum over an infinite set is involved. Therefore, 
one often proves, instead of (7.2), that there exists a sequence of increasingly fine grids, 

Gn = {t£l • ■ ■ , 4!k } x • • • x , • • ■ . t% n } , such that g C Qn and 

Pi max sup \X n (t)-X n (b(i))\>6)<Ti (7.3) 

\l<ll,---,lq<M te g n . b (t_ tq ^ t ^ b (£ J 

holds for n > no, and then derives (7.2) by continuity and monotonicity arguments. With 
a slight abuse of terminology, we also call property (7.2) stochastic equicontinuity and 

(7.3) stochastic equicontinuity on the grids Q n . 

In the following, we provide a simple proof of (7.3) based on an anisotropic dyadic 
tiling of the space. Such an anisotropic tiling has previously been used in the proof of 
Proposition 7.3 in Rio (2000), page lOOff. This proof constitutes an alternative to the 
commonly used approach based on Bickel and Wichura's (1971) fluctuation result for 
their modulus of continuity M" , and to an approach based on an isotropic tiling of the 
space proposed by Neuhaus (1971). 

Lemma 7.1. Let (X n (t)) t ^Rq be a sequence of real-valued stochastic processes. For any 
hyperrectangle B — (si,ii] x • • • x (s q ,t q ], the increment of X n around B is given by 

X n {B)= J2 (-l) 9 ~ (6l+ - +e ' ) ^ l (si+ei(ti-si),...,s 9 + e 9 (^- Sg )). 

(ei,...,£,)G{0,l} <! 

We suppose that there exists a sequence of measures (/x n )neN on {R. q ,B q ) with continuous 
marginals which converges weakly to a finite measure fj,, also having continuous marginals, 
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and 

mn(B)] 4 < W{B) + Cn-^ (7.4) 

for all hyperrectangles B and some 7 > 0, C < 00. Let 8 and r\ be arbitrary positive 
constants. 

There then exists a coarse grid Q = {b^\ . . . , b£j '} x • • ■ x {bg 9 \ . . . , b$} with —00 = 
6q < ■ ■ ■ < b^j = 00 (r = 1, . . . , q) and a sequence of fine grids Q n = {t„\, ■ ■ ■ , i„^f n } x • • ■ 

X ■ ■ ■ ,tn\l n } with Q Q Qn Such that 

Uni^"- 1 x (t^ k _ 1 ,t^ k ]xW- r )<2^)n- 1 \/k = l,...,M n ,\/r = l,...,q (7.5) 
and 

P[ max max \XJt) - XJb(i))\ > 5 ) < n (7.6) 

\l<ii,...,i q <Mt€9 n -b(i-t q )^t<b(£) J 
holds for all n>no 7 where no is sufficiently large. 

Remark 7.1. As already mentioned in the discussion after Theorem 3 of Bickel and Wichura 
(1971), it is possible (and, for the bootstrap processes, important) that the measures \x n 
are allowed to depend on n. The term n~ q in (7.4) cannot be avoided in our context; see, 
for example, (7.15) in the proof of Theorem 3.1 below. Because of this term, we obtain 
stochastic equicontinuity in a first step only on a grid with cardinality of = 0(n q ). 
In our applications, stochastic equicontinuity over the full space will then follow from 
monotonicity and continuity properties of the processes involved; see step (i) in the proof 
of Theorem 3.1 below. 

Proof of Lemma 7.1. (i) Dyadic systems of grid points. First, we define dyadic systems 
of grid points. At the coarse scales, their choice is tied to the measure /i. Let F ^ be the 
rth marginal cumulative distribution function of pL, that is, 

F {r \x)=n(W- 1 x (-oo,x] xl rr ). 

For an appropriate Jo € N to be determined in part (iii) of this proof, we define, for 
r = l,...,q and < j < J , 

{—00, if k = 0, 

FW _1 (fc2-^(E9)), i£l<k<Z>, 
00, iffe = 2J. 

We choose M = 2 Jo and b[ r) = b { J^ k . 

At the finer scales, with index j > Jq, the grid points are chosen according to ji n . We 
set 

F^{x)=^ n {W- 1 x {-oo,x] x W*- r ) 
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and choose, for j = Jq + 1, . . ., J„ with 2 J ™~ 1 < n < 2 J ™, grid points as follows. For I £ 

b (r) = b (r) ( 



{0, . . . , 2 J <> - 1} and k £ {1, • . . , 2'- J °}, we define b^L^,. = b^L Jg ,Jn) such that 



Fi r \b% 2 ._ Jo+k (n)) = FM(& J0)I ) + -^-[F r W(fe Jo , i+1 ) - F«(6 /o ,0]. 

(Again, we set (n) = — oo and bfy (n) = oo.) We set M„ = 2 J " + 1 and = 6^ , . . . , 

^n,M„ = bj 2Jn (r = l,...,q). That is, the fine grids are given as Q n = Gn x • • • x 

where ^C {6 M ,...,6« 2Jn }. 

Since fi n yu, we have, for all r = 1, . . . , q, I = 1, . . . , 2 Jo and n > no with no sufficiently 
large, that 

MM'- 1 x x R 9_r ) < 2 A1 (M'- 1 x (&« X R 9 ^) = 2 1 -*>n(R»). 

This implies that 

MM'- 1 x (6J3_ 1 ,6^] x R 9 ~ r ) < 2 1 "V(M^) (7.7) 

for all r = 1, . . . , q, j = Jo + 1, • ■ • , J n i k = 1, . . . , 2 J and n > no, that is, (7.5) is satisfied. 

(ii) A probabilistic bound for the increments of X n . To simplify notation, in the sequel, 
we use multiindices j = (ji,...,j g ) and k= (fci, . . . , k q ). For (j,k) from the set B n = 
{(jiM) : < < 2 Jn , 1 < k r < 2 Jr Vr}, we define the hyperrcctanglc 

B jk = (b {1 \ ,,b (1 \ ] x • • • x (b (q \ ,,b (q \ }. 

We choose any a £ (0,7/4) and define the thresholds 

Xj = K2~ a( - j ' L+ '" +jq ' > / q , (7.8) 

where K will be chosen below. From (7.7) and 2 J ™ < 2n we obtain that 

»n(Bj,k) + Cn' q < d min {2~ jr } < Ci2- tfl+ - +J *«>/« V(j, fc) G B„. 

Therefore, we obtain, by Markov's inequality, that 

\Un(Bj k) +n- q ] 1+ i 
P(lX4%,)l>A,)< ^ 2l '- ( . i+ ... + . g)/g 

< (/x n (7i i! fc)+n- 9 )if- 4 C72^ 1+ - + ^)( 4a -^/ 9 . 

This implies that 

P(|X„(S^)| > Aj for any (j, fc) e B„) 
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<(n n {m. q ) + 2 J " q n- q )K- 4 Cl 2^ 1+ - +1 ^ Aa -^/ q (7.9) 

h, — ,jq=0 

<v, 

for n > no, provided the constant K in (7.8) is large enough. 

(iii) Stochastic equicontinuity of X n on the fine grid. Now, assume that 

\X n (B L ,k)\ < X L for all (j,fc) e B n . (7.10) 

Moreover, let t £ Q n and i be such that b(i — t q ) -< t -< b(i). There then exist hyperrectan- 
gles Bj(i) fc (i), . . . ,B-(l) k (L> with different scale indices and (j"\k ) G B n such that 

maxi< r < g jr^ > Jo and 

L 

{x:xdi b{i)} \ {x :x ± t} = [J B^ 0j fc W ■ (7.H) 

i=i 

Accordingly, X„(6(i)) -X„(t) = Y^f=i «,&(')), which implies that 

L j n j n 

\X n (t) - X n (b(i))\ < V> <if«E £ 2-^+-+^)/«. (7.12) 

Now, choosing J eN such that /\q2- Jo "/9(l/(l - 2" a /' ? )) <? < S, we obtain that 

|X„(i)-X„(&(|))|<<5 

holds for all i and t £ Q„ with 6(1 - t q ) ^t^. b(i), whenever (7.10) is fulfilled. This, 
however, implies, in conjunction with (7.9), that (7.6) is satisfied. □ 

Proof of Theorem 3.1. We apply the method of proving weak convergence for pro- 
cesses described by, for example, Wichura (1971), Proposition 1 and Pollard (1984), 
Theorem 3 in Section V.l. To this end, we will prove (i) stochastic equicontinuity of 
(^n)nGN and (ii) weak convergence of the finite-dimensional distributions. From step (i), 
we can identify the prospective limit process U as a centered Gaussian one. Because of 
the complicated covariance function, we cannot immediately see that U possesses a ver- 
sion with continuous sample paths. However, steps (i) and (ii) together imply that there 
is a version of U which inherits the property of stochastic continuity from the processes 
(C/„)„ e N- Having this, it is then easy to conclude that this process has continuous sample 
paths with probability 1. These facts together yield the desired convergence of (C/„)„gN 
to U. 

(i) Stochastic equicontinuity of (U n )neN- We set q = p+l. We will show that there 
exists, for any 5 > and 77 > 0, a grid Q = {by, . . . , b]$} x • • • x {6 9 \ . . . , b$} with 
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-oo = b { a r) < b[ r) < ■ ■ ■ < = oo (r = 1, . . . , q) such that (with b(i) = {b^\. . . , &{«>)') 

P[ max sup \U n {t)-U n (b(i))\ > 5) <rj. (7.13) 

\l<»i.— ,*ar<Af tgR , :6 ^_ : | Lj ) ; <tX6(i) / 

We decompose U n (x,y) as 

U n (x, y) = -= V 7(X t _! < x)[I(X t <y)- F E (w(X t . u y, 9))} 

+ 4= it J ( X '-i ^ x)[F e (w(X t _ liy , 9)) - F e (w(X t -i,y, ?))] 

7.14 

1 ™ ~ 
+ — V J(X t _ x =< ^[^(X^, y, 9)) - F^wiXt^y, 9))} 

= :R^(x,y) + R^(x,y) + R^(x,y), 

say. It is now most convenient to prove stochastic equicontinuity for Rn^ , R„^ and 
Rn^ separately. We will give all details for and refer to Neumann and Paparoditis 

(2) (3) 

(2005) for more details regarding i?„ and R n . For any hyperrectangle B = B x x B y = 
(si , ti] x • • • x (s q , t q ] , denote by 

1 ™ 

R<p (B) = - r Yl / ( Xt ~ 1 e B x )[I{X t e B v) - p ( x t e B a I x *-i)] 

* n i=l 

the increment of ii„ around £?. We will first show that, for all 7 e (0, 1/q), there exists 
some constant C 7 < 00 such that 

E[R^(B)f < C 1 [P^-^ Xt {B)+n~"] 1 +^. (7.15) 

Let g B (X t -i,X t ) = I(X t -i & B x )[I{X t e B y ) - P(X t e B y | X t _i)]. Note that, for suffi- 
ciently integrable random variables Y\, . . . , Y4 with EF^ = 0, the relations 

EFi ■ ■ ■ y 4 = EFiF 2 • Ey 3 y 4 + cov(y 1 F 2 , r 3 F 4 ) 
= cov(yi,r 2 TO) 

= cov(Yi Y 2 Y 3 ,Yi) 

hold true. Since gs is a bounded function with Egs(Xt_i,Xt) = 0, we consequently 
obtain that 



E[i?«(B)] 4 <4 E |E[to(X tl „ 1 ,X tl )--- 5B (X t4 „ 1 ,X t 



4! 

tl<t2<t3<«4 
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(7.16) 



<4!< 



Yi-l 



£a, 2 (£) 



r=0 



Tl-l 



5> + l) 2 C r , 4 (B», 



r=0 



where 



C r , q {B)= max sup | cov(g B (X tl _i, X tl ) ■ ■ ■ g B (X tm _ l7 X tm ), 

i<m<9-i(t 1 ,...,t ( ,)er r , 9 (m) 

ffs(X tm+1 _i,X tm+1 )- ■•j B (X t i,I f ))| 



and 



%, q {m) = <^ (ti, . . . ,i„) :1 < tx < . . . < t g < n, max {t J+1 - tj} = t m - t m +i = r\. 

{ i<j<g— l J 

(Inequality (7.16) is similar to inequality (2.14) in Doukhan and Louhichi (1999), the 
only difference being that the '3' is absent there.) 
For small values of r, we use the simple estimate 



C r , q (B)<E\g B (X t _ 1 ,X t )\<2P Xt - l ' Xt (B) 



(7.17) 



For large values of r, we intend to exploit the weak dependence of the process (Xt)tei, m 
order to show that C r . q {B) gets small as r increases. Since g B is not Lipschitz, we define 
smooth approximations to g B , 

9B,e(x,y) = J We(u)g B +u(x,y)du, 

where (u> £ ) e >o is a family of nonncgativc functions with supp(u> c ) C {u = (ui, . . . , itp+i) : m > 
and \\u\\ h < e}. / w e (u)du = 1 and ||w e ||oo < Ct~ q . Since hvp(g B ^{u 1 ) ■ ■■gB,e(u m )) < 
m ■ Lip(gs,e) — 0(l/e), we obtain, by (2.2) or (2.6), respectively, that 



|cov(ffB i£ (X tl _i, • ■•fl , B, e (X tm _i,Xt m ), 

5s,e(X tm+1 _i, X tm+1 ) ■ ■ ■ g Bje (X tl _i,X tq ))\ 



(7.18) 



Since \x± ■ ■ -x q — y\ ■ ■ •y q \ < J2i=i \ x i ~ Vi\ for all real numbers Xi,yi G [—1,1] and, by 
Lipschitz continuity of Fx, 



E\g B (Xt-i,Xt) ~ 9B A^t-i,X t )\ < Ce, 



wc obtain that 



I cov{g B (X tl -i,X tl ) • ■■g B (X tm -i,X tm ), 
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h m+1 -i,X tm+1 ) ■ ■ ■ g B (X tl _i, X tq j) 
- cov( ffB , e (X tl _i,X tl ) • ■■5B, e (Xt m _i I -X-t m ), (7-19) 

flS,e(Xt TO+I _l,Xt TO+I )- • •5B,e(Xf 9 _i, Xf 9 ))| 

<Ce. 

From (7.17) and (7.18) and (7.19) with e = // r ~P)/ 2 , we obtain that 

a,<r(-B)<C(P Xt - 1 ' Xt (S)A / o (r - p)/2 ), 

which implies, by (7.16), inequality (7.15). 

From (7.15), we conclude, by Lemma 7.1, that there exists a coarse grid Q = 
{4 , . . . , 6$ } x • • • x {6^ , . . . , b$ } and a sequence of fine grids Q n = {t^\ ,...,^ f Jx 
• • • x {4 9) i, • ■ • , *i*M„} with S ^ g„ such that 

and 

pf max _ max |j#>(i) -P«(6(i))| > £) < 2 (7.20) 

for all ri > n and n sufficiently large. 

To extend property (7.20) to the whole space, we employ a simple monotonicity argu- 
ment. For t with i^i -i <U< t%\ Vr, we have the inequalities 

P«Ci (1) t (<?) ) 

v n t=l 

- 7(X t _ X X . . . ,*S p „ 1 ))P(X t < 4+^.! | X t _x)] 

<pW(i!,...,g 

1 " 

<^ ) («S 1 .-»«S.)+4E[-]- 

» 4=1 

It follows from the Bernstein-type inequality for weakly dependent random variables, 
from Kallabis and Neumann (2006), that 



for any (ii,...,*,) J < J. (7.21) 
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(7.20) and (7.21) together yield that 



P 



max max 

l<U,...,i a <Af ieR9:6(j-lj, + i)^t^&( 



i) \ R £)(t)-RM(b(i))\>l)<l 



(7.22) 



for all n > uq . 

(2) (3) 

Furthermore, we can also prove property (7.22) for the processes Jin and Rn , possibly 
with other coarse grids Q and Q . This yields property (7.13) for the grid Q, which is the 
combination of the grids Q , Q and Q . 

(ii) Weak convergence of 'finite- dimensional distributions. Let k G N, (xi, j/i), . . . , (xk,Dk) € 
Pxl and ci, . . . , Cfc G R be arbitrary. By the Cramer- Wold device, it suffices to show 
that 



(7.23) 



k / k \ 

^ciU n (xi, yi) — >A/"( 0, cic m T((xi,yi),{x m ,y m )) ■ 

i=l V i,TO=l / 

According to (7.14), we can show that 
1 " 

U n {x,y) = —= y~*[gi(X t -i,X t ;x,y) H \- g 3 (K t -i,X t ;x, y)} + o P {\). 

v t=i 



Moreover, it follows from (2.2)-(2.4) and (2.6)-(2.8), respectively, that the triangular 

scheme (Z n ,t)t=i,...,n with Z„, t = J2i=i c i [9i(^t-i, X t ] xi, yi) H \- g 3 (X t ^i, X t ;x h yi)} 

satisfies conditions (6.1)-(6.4) of the central limit theorem in Section 6. (Note that the 
function gi(-, - ',x,y) is discontinuous; we must use an approximation by a smoothed ver- 
sion, as above, for checking (6.2) to (6.4).) Hence, (7.23) follows immediately from The- 
orem 6.1. 

(iii) The limit process. According to (7.13), there exists a sequence of grids Q^ N ' = 



l u l,0 >■■■> U 1,M N J 



(AT) 



(TV) 



ib { 



' b< q N M N } Witl1 



-no - /) (JV) < ■ ■ ■ < h ( 



oo ( r 



l,...,q) and g( N +V C g( N ) \fN € N such that 



P 



Ki 



max sup 

i,- 1 >8<M» (6S i : s(l»)(i_ lt )^i l (»)(i) 



\U n (t)-U n (bW(i))\>±-)<±- (7.24) 



N 



N 



for n > njv and tin sufficiently large. 

Let =Uw=i^ N '- Kolmogorov's consistency theorem ensures that there exists a 
real-valued stochastic process U on Q(°°> with finite-dimensional distributions 















\U(tk)J 
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It follows from the weak convergence proved in step (ii) that U inherits the continuity 
property from the processes (Z7 ra ) ng N, that is, we have that 

PI max max \U(t) - U{b {N \i))\ > 

\l<i u ...,i q <M N teg( w '):6W(i-l,)^6W(i) N J 

(7.25) 

< — ViV' > N. 
~ N 

By continuity from below of the probability measure P, this implies that even 

PI max max \U(t) - U(b {N Ui))\ > — ) < — (7.26) 

\l<ll,. .. ,i q <M N t£G(^):b( N ){i-t q )^t^<b< N Hl) NJ N 

holds true. 

We now extend the process U to a process U on K 9 . Fix any t € R 9 . There then exists 
a sequence (i [N) ) N eN such that b^ N \i {N) - 1,) r< t r< & W (i (JV) ) • It follows from (7.26) 
that U(b( N \i (N) )) converges in probability. We set 

U(t)= plim U(b (N) {i< N) )). 

It is clear that the process U has the same stochastic continuity property as U , that is, 
max sup l^-C/^Wd))! >!)<!. (7.27) 

This means, in particular, that U has, with probability 1, continuous sample paths. More- 
over, it follows from (7.13), (7.23) and (7.27) that the finite-dimensional distributions of 
U„ converge to those of U. The latter property yields, in conjunction with (7.13) and 

(7.27), by Theorem 3 in Section V.l of Pollard (1984), that U n -±+U. □ 

Proof of Lemma 4.1. First, note that for the function w(-), as it is specified under 
(Al) or (AT), there exists S > such that, for all 9 1 ,6 2 £ U s (5) = {i9 G 9, ||t? - <9|| < 
6} and all compact sets K C R p+1 , \w(x,y,8i) — w(x, y, #2)) < L(x,y)\\8i — 82W with 
sup x yeK L(x,y) < 00. For such a 6 and K < 00, define 

0„ = . . . ,X n ) : ...,*„)- 0|| < 5, E((e*) 2 | Xi_ p , . . . , Jf n ) < ft}. 

Choose K < 00 large enough so that P((Xi_ p , . . . , X„) e f2„) — * 1 as n —* 00. We then 
have, for w„ <E fi n , 

|P(X t *<|/|X?_ 1 =x)-f e (w(a; J y,5))| 
= |P e Hx ! y,?))-F e K a ;,j/,e))| 

<|FeW^y,^))-PeKa;,y,e))| + |P ; £ Ka ; ,y,?))-F e Kx,y ! ?))|. 
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For the first term on the right-hand side of the last inequality above, we obtain, using 
(Bf), that 

sup \F e {w(x,y,6)) - F e (w(x,y,9))\ < ll/ e |U|0- 6\\ supL{x,y)^0 

x£K xGK 

as n — ► oo. The second term can be majorized by \\F e — i^Hoo, which converges to zero 
in probability; see Neumann and Paparoditis (2005) for details. □ 

Proof of Lemma 4.2. Let pW = pYu-,Y k and pW = pY^\„„Y^\ For k = p, 
the proof that (4.4) holds true follows the lines of the proof of Theorem 3.3 in 
Paparoditis and Politis (2002). 

For k < p, the assertion follows from (4.4) with k = p, by the continuous mapping 
theorem. For k > p, the assertion follows by induction from the result for k = p and the 
fact that, for any bounded and uniformly continuous function / :K fc — > R, the relation 

//dPW = / [W(Yt n \... ) Y£ n) )\(Yl n \...,Y£\)' = y) 

- E(f(Y 1 , ...,Y k )\(Y 1 ,..., Y k ^Y = y)]P^ (dy) 

+ I E(/(r 1 ,...,r fc )|(y 1 ,...,y fe _ 1 )' = y)[pf- 1 )(djy)-p( fe - 1 )(dy)] 

fdP {k) 
fdP {k) 

holds true. □ 

Proof of Corollary 4.1. For some null sequence (5 n )n&i and appropriate K < oo, we 
define a set of "favorable events" such that 



tin C {(!!_„, . . .,X n ) : ||0 n (Xi, . . .,X n ) - 0\\ < 5„ 



(7.28) 



E((e*y\X 1 _ p ,...,X n )<K}. 

Moreover, let the VL n be such that, for any sequence (w n )neN with uj n G fl n , 

C(e* t | {X 1 _ p ,...,X n )=Lu n )=>£(e t ). 

The constant K < oo and the sequence {5 n ) n ^ above are chosen such that 6 n — > and 
P((Xi— p , . . . ,X„) G ft n ) ~ * 1 as n — ► oo. Now, let (o;„) ne N be an arbitrary sequence with 
w n G ri n - We now assume that the bootstrap distributions arc taken under the condition 
that (Xi-p, . . . , X n ) = Lo n . (This refers, in general, to a triangular scheme, but not to a 
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single sequence of X t .) Since 6 n (Xi- p , . . . ,X n ) — > 9 and E({e1) 2 \ Xi- P , . . . ,X n ) < K, 

n — >oo 

the conditions of Lemma 4.2 are fulfilled, which yields the assertion. □ 

Proof of Lemma 4.3. We only stress the essentials of the proof. Let (w„) n£ n be any 
sequence with uj n € Q n , where Cl n is chosen as in the proof of Corollary 4.1. Assume that 
the distributions are taken under the condition (A"i_ p , . . . , X n ) = tu n . 

For the AR(p) case, let be the roots of the polynomial #(•) with 9(z) = 

1 — 9\z — ■ ■ ■ — 8 p z p . According to (Al), we have that e := min{|£i|, . . . , — 1 > 0. If 
S n in (7.28) is sufficiently small, then we obtain, for the roots £ nj i, . . . ,£ n ,p of 9(z) = 
1 — B n .\z — ■ ■ ■ — 9n tP z p , that min{|£ nj i|, . . . , |£n.p|} > 1 + e/2; sec Theorem 1.4 in Mardcn 
(1949). Thus, there exists a stationary solution to the equation A" t * = 9 n ^X^_ 1 + • • • + 
9n :P Xt_ p + e* t which can be written as an MA(oo)-process, X% = Y^k=o@ n , k£ t-k> where 
\Pn,k\ < Cs(l + e/2 — 5)~ k for any 5 > and corresponding Cs < oo. The rest of the proof 
then follows that in Section 2.1. 

To show the weak dependence of the bootstrap process in the ARCH(p), choose 6 n < 
rj/p in (7.28), where 77 = 1 - Yn=i^i > °' and notc that ' for a11 ^ = U 5 (9), 
there exists some rj £ (0, 1) such that X)f=i 0i<l — r}. Furthermore, a (unique) station- 
ary solution to the equation X? = et \J 9$ + 9\X^\ + • • • + 9 p X^ p does exist. The result 
then follows by applying the same coupling scheme as in the proof of Lemma 2.1; see 
Neumann and Paparoditis (2005) for details. □ 

Proof of Theorem 4.1. The method of proof is exactly the same as that for Theo- 
rem 3.1. Lemma 4.3 ensures that (X t *) t6 z satisfies appropriate conditions of weak de- 
pendence which yields, in conjunction with the fact that P Xt (B) converges to P Xt (B) 
with a sufficiently fast rate, that (U*)t^z is stochastically cquicontinuous. Convergence of 
the finite-dimensional distributions to a Gaussian limit again follows from Theorem 6.1, 
while the result of Corollary 4.1 ensures that its covariance function is the same as for 
the original process under the null; for details, see Neumann and Paparoditis (2005). □ 
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